CDS

Accession Number TCMCG075C22271
gbkey CDS
Protein Id XP_017980331.1
Location complement(join(8596300..8596374,8597021..8597105,8597794..8598014,8598869..8599251,8599562..8600177,8600276..8600335,8600490..8600556,8601396..8601492,8601590..8601703,8602115..8602169,8602273..8602414,8602893..8603701))
Gene LOC18594416
GeneID 18594416
Organism Theobroma cacao

Protein

Length 907aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018124842.1
Definition PREDICTED: SART-1 family protein DOT2 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category A
Description U4 U6.U5 tri-snRNP-associated protein
KEGG_TC -
KEGG_Module M00354        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03041        [VIEW IN KEGG]
ko04121        [VIEW IN KEGG]
KEGG_ko ko:K11984        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03040        [VIEW IN KEGG]
map03040        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGATAAGGATAGGTATGATAGGGAAGATGATGTTTCTAGAGAACGCTGGGATGGAGGAGCCTACAGTGATGAATTAGAGCAAAACGATAAGCATCGGAGTAAGGACAAGAAGAAGAGCAGCCTGGAAGAAGAAAAGGATCATCGAAGTAGGGATAGAGAGCGGGACCGTTCAAAGAGAAGTAATGATGAGATCTTGAAAGAAAGGGAGAAAGATTTTAAAGATTTGGAAAAGGATAGAGTGTCGAGCAGGGAAAGGAGGAAGGATGATAGAGATGAACATGGAAAGGACAGAAGTAGAGATAGTAAGGTGAGAGAAAAAGAGAAAGATTATGATAGGGACAAGTATAGAGAAAAAGAACATGAGCGTGAGAGAGAAAAGGATCGAAAGGATCGAGGAAAGGAAAAGGACAGGGAAAGGGGTAGAGACTCTGAGAAGGAGAGGGGGAAGGATAAAGGTAGAGATAGGGATAGAGAAAAGGAAAAGGAAAGAGACAAGGCCAAGGAAAGAGAGAAAAAGGATCGAGAGAAGGAGAGGGAAGGTGAAAAGGATAGGGATAGAGATAGAGAGAAGGGAAAGGAGAGAAGTAAACAGAAAAGTAGAGAGGCAGACCTAGAGAAGGAGAGATCAAGAGATAGGGATAATGCAATCAAGAAAAACCATGAGGAAGATTATGAAGGAAGCAAAGATGGAGAGCTTGCATTAGACTATGGAGACAGTAGGGATAAGGATGAAGCTGAATTGAATGCTGGCAGCAATGCAGGTGTAGCACAGGCATCATCATCAGAGCTTGAGGAGCGCATTGCAAGAATGAAAGAAGAAAGATTGAAAAAGAAATCTGAAGGTGTTTCAGAGGTTTTAGAATGGGTTGGTAACTTTCGTAAGCTTGAGGAGAAAAGGAATGCTGAAAAGGAGAAAGCCTTGCAGCGATCAAAGATTTTTGAGGAGCAGGATGATTTTGTTCAAGGGGAAAATGAAGATGAGGAGGCTGTCCGTCATGCTGCTCATGATCTAGCAGGGGTTAAAGTTCTTCATGGCCTTGACAAAGTGATGGATGGTGGAGCTGTTGTTTTGACACTCAAAGATCAGAGCATACTTGCTAATGGTGACATTAATGAAGATGTTGATATGCTTGAAAATGTTGAAATTGGAGAGCAGAGGAGGCGGGATGAGGCTTACAAGGCTGCAAAGAAAAAAACCGGGGTTTATGATGATAAGTTCAATGATGAGCCTGGTTCAGAGAAAAAAATACTTCCTCAATATGACAATCCAGTTGCAGACGAGGGGGTAACTCTGGATGAAAGGGGGCGCTTTACCGGTGAAGCAGAAAAGAAATTGCAGGAGCTCCGTAAAAGGTTACAAGGTGTTCCCACAAATAACCGTGTTGAAGATCTTAACAACGCTGGGAAGATTGCATCAGATTATTATACCCAAGAGGAAATGCTTAAATTTAAAAAGCCCAAAAAAAAGAAAGCTTTGCGGAAGAAAGAGAAGCTGGATATAGATGCCCTTGAAGCAGAAGCTATCTCTTCAGGCCTGGGAGCTGGAGATCTTGGTTCTAGAAATGATGCTAGAAGACAGGCAATTAGAGAGGAGGAGGCCAGATCTGAGGCTGAAAAGAGAAATAGTGCATACCAATCTGCATATGCCAAGGCAGATGAGGCATCTAAATCACTGTGGCTTGAACAAACTCTTATAGTTAAACCAGAGGAAGATGAAAATCAAGTGTTTGCCGATGATGATGATGATCTTTATAAATCCATTGAAAGATCAAGGAAATTAGCTTTTAAAAAGCAAGAAGATGAAAAATCAGGTCCTCAAGCTATTGCGCTCCGTGCTACCACAGCTGCTATCAGTCAAACTGCAGATGATCAAACTACCACAACTGGAGAGGCACAAGAAAACAAGCTTGTAATAACAGAGATGGAGGAGTTTGTATGGGGTCTTCAGCATGATGAAGAAGCCCATAAACCTGACAGTGAAGATGTTTTTATGGATGAGGATGAAGTGCCGGGAGTTTCTGAACATGATGGAAAAAGTGGAGAAAATGAGGTAGGTGGATGGACAGAAGTAGTTGATGCTAGTACAGATGAAAACCCTTCCAATGAGGACAAGGATGATATAGTTCCAGATGAAACCATTCATGAAGTTGCAGTTGGTAAAGGATTATCAGGTGCCCTGAAGCTGCTTAAAGATCGAGGAATGCTTAAAGAAAGTATTGAATGGGGTGGCAGGAACATGGACAAGAAAAAGAGCAAACTTGTTGGCATTGTAGATGATGATCGTGAAAATGATAGATTTAAAGATATTCGCATTGAGAGGACAGATGAATTTGGTCGAATTATAACTCCCAAGGAAGCCTTCCGGGTGCTTTCTCATAAATTTCATGGCAAGGGGCCTGGCAAAATGAAGCAAGAGAAACGGCAGAAGCAATATCAGGAAGAATTGAAGCTGAAGCAAATGAAAAATTCTGATACACCTTCCCTGTCAGTGGAGAGGATGAGGGAAGCTCAAGCTCAGCTGAAAACACCCTACCTTGTCCTTAGTGGTCATGTGAAACCAGGGCAAACGAGTGATCCTAGAAGTGGCTTTGCTACTGTTGAGAAGGATTTTCCAGGAGGCTTGACACCTATGCTTGGTGATAGAAAGGTTGAGCATTTCCTTGGAATTAAGCGCAAGGCTGAGCCAGGGAATTCAAGCACACCAAAGAAGCCTAAAACCTGA
Protein:  
MDKDRYDREDDVSRERWDGGAYSDELEQNDKHRSKDKKKSSLEEEKDHRSRDRERDRSKRSNDEILKEREKDFKDLEKDRVSSRERRKDDRDEHGKDRSRDSKVREKEKDYDRDKYREKEHEREREKDRKDRGKEKDRERGRDSEKERGKDKGRDRDREKEKERDKAKEREKKDREKEREGEKDRDRDREKGKERSKQKSREADLEKERSRDRDNAIKKNHEEDYEGSKDGELALDYGDSRDKDEAELNAGSNAGVAQASSSELEERIARMKEERLKKKSEGVSEVLEWVGNFRKLEEKRNAEKEKALQRSKIFEEQDDFVQGENEDEEAVRHAAHDLAGVKVLHGLDKVMDGGAVVLTLKDQSILANGDINEDVDMLENVEIGEQRRRDEAYKAAKKKTGVYDDKFNDEPGSEKKILPQYDNPVADEGVTLDERGRFTGEAEKKLQELRKRLQGVPTNNRVEDLNNAGKIASDYYTQEEMLKFKKPKKKKALRKKEKLDIDALEAEAISSGLGAGDLGSRNDARRQAIREEEARSEAEKRNSAYQSAYAKADEASKSLWLEQTLIVKPEEDENQVFADDDDDLYKSIERSRKLAFKKQEDEKSGPQAIALRATTAAISQTADDQTTTTGEAQENKLVITEMEEFVWGLQHDEEAHKPDSEDVFMDEDEVPGVSEHDGKSGENEVGGWTEVVDASTDENPSNEDKDDIVPDETIHEVAVGKGLSGALKLLKDRGMLKESIEWGGRNMDKKKSKLVGIVDDDRENDRFKDIRIERTDEFGRIITPKEAFRVLSHKFHGKGPGKMKQEKRQKQYQEELKLKQMKNSDTPSLSVERMREAQAQLKTPYLVLSGHVKPGQTSDPRSGFATVEKDFPGGLTPMLGDRKVEHFLGIKRKAEPGNSSTPKKPKT